<p>The vast majority of real-world reinforcement learning systems use an interactive contextual bandits approach. This approach enables people and organizations to learn and adapt continuously.</p>

<p>For an overview of a contextual bandit approach to reinforcement learning and how to use Vowpal Wabbit, see the <a href="/tutorials/contextual_bandits.html">contextual bandit reinforcement learning tutorial</a>.</p>
